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Abstract — Coding over subsets (known as generations) rather 
than over all content blocks in P2P distribution networks and 
other applications is necessary for a number of practical reasons 
such as computational complexity. A penalty for coding only 
within generations is an overall throughput reduction. It has 
been previously shown that allowing contiguous generations to 
overlap in a head-to-toe manner improves the throughput. We 
here propose and study a scheme, referred to as the random annex 
code, that creates shared packets between any two generations 
at random rather than only the neighboring ones. By optimizing 
very few design parameters, we obtain a simple scheme that 
outperforms both the non-overlapping and the head-to-toe over- 
lapping schemes of comparable computational complexity, both 
in the expected throughput and in the rate of convergence of the 
probability of decoding failure to zero. We provide a practical 
algorithm for accurate analysis of the expected throughput of 
the random annex code for finite-length information. This algo- 
rithm enables us to quantify the throughput vs. computational 
complexity tradeoff, which is necessary for optimal selection of 
the scheme parameters. 

I. Introduction 

In randomized network coding applications, such as P2P 
content distribution 11!) or streaming, the source splits its 
content into blocks. For a number of practical reasons (e.g, 
computational complexity and delay reduction, easier synchro- 
nization, simpler content tracking), these blocks are further 
grouped into subsets referred to as generations, and, in the 
coding process at the source and within the network, only 
packets in the same generation are allowed to be linearly 
combined. A penalty for coding only within generations is 
an overall throughput reduction. The goal of this work is 
to develop a strategy which allows generations to overlap, 
and hence, to improve the throughput while maintaining the 
benefits brought up by introducing generations. 

Since the idea of coding over generations has been intro- 
duced by Chou et al. in ||2l, a number of issues concerning 
such coding schemes have been addressed. Maymounkov et 
al. studied coding over generations with random scheduling of 
the generation transmission in [3]. They referred to such codes 
as chunked codes. This random scheduling coding scheme has 
also been studied in our recent work |41 . The tradeoff between 
the reduction in computational complexity and the throughput 
benefits was addressed in |[3l, ID, and the tradeoff between 
the reduction in computational complexity and the resilience 
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to peer dynamics in P2P content distribution networks was the 
topic of H. 

Here, we are particularly interested in recovering some of 
the throughput that is lost as a consequence of coding over 
generations by allowing the generations to overlap. It has 
been observed that with random scheduling, some generations 
accumulate their packets faster, and can be decoded earlier 
than others. If generations are allowed to overlap, the decoding 
of some generations will reduce the number of unknowns in 
those lagging behind but sharing packets with those already 
decoded, and thus help them become decodable with fewer 
received coded packets, consequently improving the through- 
put. Coding with overlapping generations was first studied in 
m and 0. 

In this work, we propose a coding scheme over generations 
that share randomly chosen packets. We refer to this code 
as the random annex code. We demonstrate with both a 
heuristic analysis and simulations that, assuming comparable 
computational complexity, with its small number of design pa- 
rameters optimized, the simple random annex coding scheme 
outperforms the non-overlapping scheme, as well as a "head- 
to-toe" overlapping scheme. The head-to-toe scheme was the 
topic of f7\, in which only contiguous generations overlap in a 
fixed number of information packets in an end-around mode, 
and hence only contiguous generations can benefit from each 
other in decoding. 

The main contribution of this paper is an accurate and 
practical evaluation of the expected throughput performance 
of the random annex code for finite information lengths. We 
first anatomize the overlapping structure of the generations to 
quantify the benefit of previously decoded generations on those 
not yet decoded, namely, how many fewer coded packets are 
needed for each generation to be decoded. With this accom- 
plished, we are then able to place the overlaps in oblivion and 
analyze the throughput performance of random annex codes 
under the coupon collection setting. We succeed to optimize 
the scheme design parameters based on the evaluation of the 
overhead necessary to decode all the packets. 

Our paper is organized as follows: In Section|IIl we describe 
the coding scheme for the random annex code, and introduce 
the coupon collector's model under which our analysis for 
code throughput is studied. In Section Hill we analyze the 
overlapping structure of the random annex code, and give a 
practical algorithm to evaluate the expected code performance 



for finite information length. In Section |IV| we present our 
numerical evaluation and simulation results which illustrate 
and quantify the throughput vs. computational complexity 
tradeoff brought up by our scheme. Section FV] concludes. 

II. Coding over Generations 

A. Coding over Overlapping Generations in Unicast 

We refer to the code we study as the random annex code. 
This section describes (a) the way the generations are formed, 
(b) the encoding process, (c) the decoding algorithm, and (d) 
how the computational complexity for the random annex code 
is measured. 

a) Forming Overlapping Generations: We first divide 
file into N information packets pi,p2, ■ ■ ■ ,pn- Each packet 
Pi is represented as a column vector of d information symbols 
in Galois field GF{q). We then form n overlapping genera- 
tions in two steps as follows: 

1) We partition the N packets into sets i?2, • ■ • , 

of each containing h consecutive packets. We refer to 
these n — N/h sets as base generations. Thus, Bi = 
{P{i-i)h+i,P(i-i)h+2, ■ ■ ■ ,Pih} for i = l,2,...,ri. We 
assume that is a multiple of h for convenience. In 
practice, if N is not a multiple of h, we set = iN/fi] 
and assign the last [N — {n — l)h] packets to the last 
(smaller) base generation. 

2) To each base generation Bj, we add a random annex 
Ri, consisting of / packets chosen uniformly at random 
(without replacement) from the N—h = (n—l)/i packets 
in J-\Bi. The base generation together with its annex 
constitutes the extended generation Gj = Bi U Ri. The 
size of each Gi is g = h + I. Throughout this paper, 
unless otherwise stated, the term "generation" will refer 
to "extended generation" whenever used alone. 

The members of Gi are enumerated as pj'-* , p^'' , ■ . ■ , p^g^ . 

b) Encoding: The encoding process is oblivious to over- 
laps between generations. In each transmission, the source 
first selects one of the n generations with equal probability. 
Assume Gj is chosen. Then the source chooses a coding 
vector e = [ei, 62, . . . , e^]^ with each entry chosen indepen- 
dently and equally probably from GF{q). A new packet p 
is then formed by linearly combining packets from Gj by 
e: p ~ J2i=i ^ipf'' ■ The coded packet p is then sent over 
the communication link to the receiver along with the coding 
vector e and the generation index j. 

c) Decoding: Decoding starts with a generation for 
which the receiver has collected (h + I) coded packets with 
linearly independent coding vectors. The information packets 
making up this generation are decoded by solving a system 
of {h + I) linear equations in GF{q) formed by the coded 
packets and the linear combinations of the information packets 
by the coding vectors. Each decoded information packet is 
then removed as an unknown from other generations' coded 
packets in which it participates. Consequently, the number 
of unknowns in all generations overlapping with those that 
are already decoded is reduced, and some such generations 



may become decodable even when no new coded packets 
are received from the source. Again, the newly decoded 
generations resolve some unknowns of the generations they 
overlap with, which in turn may become decodable and so 
on. This is the mechanism through which the generation 
overlapping potentially improves the throughput. We declare 
successful decoding when all N information packets have been 
decoded. 

d) Computational Complexity: The computational com- 
plexity for encoding is 0{{h + l)d) (recall that d is the number 
of information symbols in each packet as defined in Part a)) 
per coded packet for linearly combining the (h+l) information 
packets in each generation. For decoding, the largest number 
of unknowns in the systems of linear equations to be solved is 
(h + l), and therefore the computational complexity is upper 
bounded by 0{{h + + {h + l)d) per information packet. 

While some may argue that the assumption of random 
scheduling of generations deviates from reality, we put for- 
ward here a few motives behind its adoption: (1) Locality: 
Uniformly random scheduling assumes knowledge of least 
information, which to some extent approximates the case 
with limited-visioned peer nodes in large-scale systems; (2) 
Ratelessness: in the case of single-hop multicast over erasure 
channels, the code throughput automatically adapts to all era- 
sure rates. Some previous works on coding over generations, 
such as ID and Q, also assumed random scheduling. 

We measure code throughput by the number of coded 
packets necessary for decoding all the information packets. 

B. Coupon Collector's Problem 

As in 14J, we model the collection of coded packets from 
n generations as the sampling of a set of n coupons with- 
out replacement. In the next section, we will look into the 
overlapping structure of the extended generations, and use our 
extension of the coupon collector's brotherhood problemfS), 
||9l described in ID to evaluate the throughput performance 
of the random annex code. We will also compare the per- 
formance of random annex code to the overlapped chunked 
code proposed in I?), which has the generations overlap in 
a "head-to-toe" fashion. Note that the random annex code in 
fact defines a code ensemble that encompasses the overlapped 
chunked code. 

III. Overlapping Generations-An Analysis of 
Expected Performance 

A. Overlapping Structure 

The decoding of different generations becomes intertwined 
with each other as generations are no longer disjoint. Our goal 
here is to unravel the structure of the overlapping generations, 
in order to identify the condition for successful decoding of 
random annex codes over a unicast link. 

In Claims [T] through H] we study the overlapping structure 
of the random annex code. Compared with the head-to-toe 
overlapping scheme, an extended generation in the random 
annex code overlaps more evenly with other generations. 



Intuitively, this can help with code throughput when random 
scheduling of generations is used. 

Claim 1: For any packet in a base generation Bk, the 
probabihty that it belongs to annex Rr for some r e 
{l,2,...,n}\{fc} is 



N -h-1 



N -h 



I 



l-l J'\ I J N-h {n-l)h 

while the probability that it does not belong to i?^ is tt = I — tt. 

Claim 2: Let X be the random variable representing the 
number of generations an information packet participates in. 
Then, X = 1 + Y where Y is Binom(77 — 1, tt). 

I 



and 



E[X] = 1 + (n - l)7r = 1 + - 

n 



Var[X] = {n- l)nTt. 



Claim 3: In any generation of size g = h + I, the expected 
number of information packets not present in any other gener- 
ation is /iTT^"^^'' « /le^'/'' for ti ^ 1. The expected number 
of information packets present at least once in some other 
generation is 



/ + h[l 



l + h 



< miiijg, 21} 



for n > 1 and I > 0. 

Claim 4: The probability that two generations overlap is 1 — 

(i ; N~2h-2i) I (^r'') ■ ™y given generation, the number 
of other generations it overlaps with is then 



Binom n — 1, 



1 - 



X -2h 
LLX ~2h- 



21 



X -h 



The following Theorem |5] gives the expected overlap size 
r2(s) between the union of s generations and an (s + l)th 
generation. 

Theorem 5: For any / C {1,2,..., 71} with |/| — s, and 
any j e {1, 2, . . . , 

n{s) = E[\ (Ue/G,) n |] = g ■ [1 - Tf"*] + sh ■ tttt'' (1) 

where \B\ denotes the cardinality of set B, and tt, tt are as 
defined in Claim [T] 

When n ^ 00, if I; ^ a and ^ and let w(/3) = 17(s), 
then, w(/3) ~^h[{l + a) (l - e ""'3) + a^e-"'^] . 
We provide a proof of Theorem [5] in Appendix [X] 

B. An Analysis of Overhead Based on Mean Values 

We next describe an analysis of the expected number of 
coded packets a receiver needs to collect in order to decode 
all X information packets of when they are encoded by 
the random annex code. We base our analysis on Theorem |5] 
above and Claim |6] and Theorem [7] below, and use the mean 
value for every quantity involved. 

By the time when s{s ~ 0,1, ... ,n — 1) generations have 
been decoded, for any one of the remaining {n~s) generations, 
on the average ^l{s) of its participating information packets 
have been decoded, or (g — 51(s)) of them are not yet resolved. 



If the coded packets collected from some generation are 
enough for decoding its unresolved packets, that generation 
becomes the (s + l)th decoded one; otherwise, if no such 
generation exists, decoding fails. 

The following Claim |6] estimates the number of coded 
packets needed from a generation for its decoding when (g—x) 
of its information packets remain to be resolved. 

Claim 6: For any generation G^, if x of the g = h + I 
information packets of Gi have been resolved by decoding 
other generations, then, the expected value of the number 
of coded packets Xi{g,x) from Gi needed to decode the 
remaining (g — x) information packets 



E[X,{g,x)]<g-x + 



1 



i-q- 



(2) 

Proof: In all the coding vectors, remove the entries 
corresponding to the information packets already resolved. 
Now we need to solve a system of linear equations of (g — x) 
unknowns with all coefficients chosen uniformly at random 
from GF(q). Thus, X,{g,x) = X,{g - x,0). 



9-^-^ / a-x 7 \ -1 

gf - 



E[X.{g,x)]= J2 > . 

j=o ^ ^ 



:dy 



1 



< 



1 - qy-g+^ " 1 - g"^ 



9-X+ — + log — — 

1 — g^ ^ 1— g^ 



(3) 

(4) 
(5) 



Extending the domain of rj{x) from integers to real numbers, 
we can estimate that the number of coded packets needed 
for the (s + l)th decoded generation should exceed m'^ = 
\ri{rt{s))~\. Since in the random annex code, all generations 
are randomly scheduled with equal probability, for successful 
decoding, we would like to have at least ttiq coded packets 
belonging to one of the generations, at least m'j belonging to 
another, and so on. We wish to estimate the total number of 
coded packets needed to achieve the above. 

For any in G N, we define Sm{x) as follows: 



X X 

Sm{x) =1 + Y? + 2r 



(to - 1)! 
Soo{x) =exp(x) and So{x) = 0. 



(to > 1) (6) 
(7) 



The following Theorem |7] which is a restatement of Theorem 
2 from in using the terminology of coding over generations, 
provides a way to estimate the expected number of coded 
packets necessary for decoding the whole file 

Theorem 7: (Theorem 2, ||4]) Suppose for some ^ G N, 
integers fci, . . . , fc^ and toi, . . . , niA satisfy 1 < fci < • • • < 
kA < n and 00 ~ nia > toi > • • • > niA > mA+i = 0. Let 
fir be the number of generations for which at least r coded 
packets have been collected. Then, the expected number of 
coded packets necessary to simultaneously have > kj for 



all j = 1, 2, . . . , A is 



(8) 



E n 

(n.'2 ^a)^ j=0 

•0=0 i>A-|-l='» 
j = l,2,...,A 

A practical method to evaluate ^ is provided in Appendix 
IB] The computational complexity for one evaluation of the 
integrand is 0{An^) given mi = 0{An^). 

The algorithm for our heuristic analysis is listed as follows: 

1) Compute n{s — 1) for s = 1,2, ... ,n using Theorem|5j 

2) Compute m'^ = [?y(0(.s — 1))] for s ~ 1,2, . . . ,n using 
© from Claim |5J 

Map mg(s = l,2,...,Ti) into A values — 



3) 



1,2, 



. . , A) SO that rrij 
m'l^., for j = 1, 2, . . . , A, mi > m2 > • • • > > 
niA+i = 0, fco = and fc^ = n; 
4) Evaluate (O in Theorem |7] with the A, kjS, and m^s ob- 
tained in StepO, as an estimate for the expected number 
of coded packets needed for successful decoding. 

Remark 1: The above Step [3]l is viable because n{s) is 
nondecreasing in s, the righthand side of ^ is non-increasing 
in X for fixed g, and thus m'^ is non-increasing in s. 

Although our analysis is heuristic, we will see in the next 
section that the estimate closely follows the simulated average 
performance curve of the random annex coding scheme. 

IV. Numerical Evaluation and Simulation Results 

A. Throughput vs. Complexity in Fixed Number of Genera- 
tions Schemes 

Our goal here is to find out how the annex size / affects 
the throughput performance of the scheme with fixed base 
generation size h and the total number of information packets 
N (and consequently, the number of generations n). Note 
that we may be trading throughput for complexity since the 
generation size g = h + l affects the computational complexity 
of the scheme. 

Figure [T] shows the analytical and simulation results when 
the total number N of information packets is 1000 and the base 



generation size h is 25. Figure [ Jla)| shows h + l — fl{s) for s — 
0, 1, . . . , n with different annex sizes. Recall that il{s) is the 
expected size of the overlap of the union of s generations with 
any one of the rest n — s generations. After the decoding of s 
generations, for any generation not yet decoded, the expected 
number of information packets that still need to be resolved 
is then h + l — Q.{s). We observe that the h + l — Q.{s) curves 
start from h + I for s = and gradually descends, ending 
somewhere above h — I, for s ~ n — 1. 

Recall that we measure throughput by the number of coded 



packets necessary for successful decoding. Figure [l |lb)| shows 
the expected performance of the random annex code, along 
with the head-to-toe overlapping code and the non-overlapping 



code(Z = 0). Figure [ JJc)! shows the probability of decoding 
failure of these codes versus the number of coded packets 
collected. 
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Our analysis for the expected number of coded packets re- 
quired for successful decoding extremely closely matches 
the simulation results. 

For both the random annex scheme and the head-to- 
toe scheme, there is an optimal annex size, beyond or 



below which throughput is lower than optimal. From the 



simulation results in Figure [l Ub)[ it is observed that the 
optimal annex size is 12 for the random annex scheme 
and 8 for the head-to-toe scheme. Beyond the optimal 
annex size, throughput cannot be increased by raising 
computational cost. 

• The random annex code outperforms head-to-toe over- 
lapping at their respective optimal points. Both codes 
outperforms the non-overlapping scheme. 

• We also plotted the probability of decoding failure versus 
the number of coded packets received. The probability 
of decoding failure of the random annex code con- 
verges faster than those of the head-to-toe and the non- 
overlapping scheme. 

B. Enhancing Throughput in Fixed Complexity Schemes 

Our goal here is to see how we can choose the annex 
size that optimizes the throughput with negligible sacrifice in 
complexity. To this end, we fix the extended generation size 
g ^ h + I and vary only the annex size I. Consequently, the 
computational complexity for coding remains roughly constant 
(actually decreases with growing /). 

Figure |2] shows the analytical and simulation results for the 
code performance when the total number N of information 
packets is fixed at 1000 and size g of extended generation 
fixed at 25. 

• Again our analytical results agree with simulation results 
very well; 

• It is interesting to observe that, without raising com- 
putational complexity, increasing annex size properly 
can still give non-negligible improvement to throughput. 
There is still an optimal annex size that achieves highest 
throughput. From Figure | 3Ia)[ we see that the optimal 
annex size is 10 for the random annex scheme and 6 for 
the head-to-toe scheme; 

• The random annex code again outperforms head-to-toe 
overlapping at their optimal points. Both codes outper- 
form the non-overlapping scheme; 

• We again observe that the probability of decoding failure 
of the random annex code converges faster than those of 
the head-to-toe and the non-overlapping schemes. 

V. Conclusion And Future Work 

We proposed a random annex scheme for coding with 
overlapping generations. We obtained an accurate analytical 
evaluation of its expected throughput performance for a unicast 
link using an extension of the coupon collector's model derived 
in our recent work ID. Both the expected throughput and the 
probability of decoding failure of the random annex code are 
generally better than those of the non-overlapping and head- 
to-toe overlapping coding schemes. Under fixed information 
length and fixed number of generations, there exists an optimal 
annex size that minimizes the number of coded packets needed 
for successful decoding. One of our most interesting findings 
is that when we fix the information length and the generation 
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size, increasing the annex size may still improve throughput 
without raising computational complexity. 

We developed a practical algorithm to numerically evalu- 
ate some of our complex analytical expressions. With slight 
modification of the analytical method used in this work, we 
can also predict the expected decoding progress, i.e., the 
number of generations/information packets decodable with the 
accumulation of coded packets. This will be useful to studies 
of content distribution with tiered reconstruction at the user. It 
can also be used to find the best rate of a "precode" 131 applied 
to coding over generations. It would be interesting to know if 
the combination of overlapping generations and precode can 
further improve code throughput. 

It is also interesting to study the asymptotic performance 
of the code, as the information length tends to infinity. We 
also hope to characterize the optimal annex size in terms of 
generation size and number of generations. 

Appendix A 
Proof of Theorem[5] 

Without loss of generality, let / = {1,2, 
s + 1, and define 7^,, = Uf^^i?;, 6, 



• , s} and j 
and Qs 



Uf^iG, for s = 0, 1, . . . ,n - 1. Then, E [\ (U,e/G,) n Gj\] = 
E [\Qs n Gs+il]. For any two sets X and Y, we use X + Y 
to denote XUY when X n F = 0. 

n Gs+i ={Bs + ns\Bs) n + ii-.+i) 

And therefore 

£;[|a.nG,+i|] =i;[|B,ni?,,+i|]+ (9) 

E[\TZs n B.+i |] + E[\{TZs\Bs) D Rs+i \] 

Using Claim [l] we have 

E[\BsnRs+i\]^ shn, (10) 
E[\n, n B,+i\] = h[l - {1 - 7:)% (11) 
E[\{ns\Bs) n Rs+i\] ^{n-s- l)hn[l - (1 - ^)^], (12) 

where tt is as defined in Claim [T] Bringing (fT0]i-(fT2]i into (|9]l 
we obtain ([T]i. 

Furthermore, when 71 — s> co, if ^> a and s/n ^ (3, then 



>h{l + a 
hal3(l - 



1-1- 



n - 1 



n — 1 

(l + a)(l-e-"^)+a/3e-"^ 
1 + a - (1 + a - a/3)e-"'' 



Appendix B 
Evaluation of Expression ^ 

We give here a method to calculate the integrand in ^ of 
Theorem I2I The integrand of (O can be rewritten as 

1- E Q^)[(^m.(x)-5™,^,(x))e-n— (13) 



iA=kA 

E 

^2 / ■ 
«2 



M-1 



[{SniA-ii^) 'S'm^(a;))e 



n=fei 



»0 



It is not hard to find an algorithm that calculates 1 — (/)^.„(a;) 
in (ciTOi+C2(n-l)+C3 I]fc=/cj+i ^ime, where 

ci, C2 and C3 are positive constants. As long as mi = 0{An^), 
we can estimate the amount of work for a single evaluation 
of the integrand of (|6]l in Theorem |7] to be 0{An^). 

The integral can be computed through the use of some 
efficient quadrature method, for example, Gauss-Laguerre 
quadrature. For reference, some numerical integration issues 
of the special case where A ~ 1 have been addressed in Part 
7 of HOl and in ITTI . 
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For fc = /ci , fci + 1, . . . , n, let 
For each j = 1, 2, . . . , A, let 

W — kj ^ ^ 

for k = fcj+i , -j- 1, . . . , n. 



Then, one can verify that ( flJl l is exactly 1 — (j)A,n{x)- 



